A Latent Source Model for Online Time Series Classfication

نویسندگان

  • Zhang Zhang
  • Aravind Srinivasan
چکیده

We study a binary classification problem with infinite time series having more than two labels (”event” and ”nonevent” or ”trending” and ”non-trending”). We want to predict the label of the time series given some training data set. Intuitively, the longer we wait, the longer the time series we can observe so that the prediction is more accurate. However, in many applications, such as predicting which topic will go popular in a social network or revealing an imminent market crash, making a prediction as early as possible is highly valuable.Motivated by these applications, we look into a latent source model which is a nonparametric model to predict the binary status of a time series. Our main assumption is that these time series only have a few ways to reach the binary status such as Twitter topic going trending online. The latent source model naturally leads to a weighted majority voting as the classification rule without knowing the latent source structure. In the project: 1. We will investigate the theoretical performance guarantees of the latent source model; 2. We will implement the model by programming language C; 3. We will investigate the strategy to estimate the values of different parameters; 4. We will test our implementation and use the model to predict which news topics on Twitter will go viral to become trends and analyze the results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Latent Source Model for Online Time Series Classification

In supervised classification, one attempts to learn a model of how objects map to labels by selecting the best model from some model space. The choice of model space encodes assumptions about the problem. We propose a setting for model specification and selection in supervised learning based on a latent source model. In this setting, we specify the model by a small collection of unknown latent ...

متن کامل

Latent source models for nonparametric inference

Nearest-neighbor inference methods have been widely and successfully used in numerous applications such as forecasting which news topics will go viral, recommending products to people in online stores, and delineating objects in images by looking at image patches. However, there is little theoretical understanding of when, why, and how well these nonparametric inference methods work in terms of...

متن کامل

Online Streaming Feature Selection Using Geometric Series of the Adjacency Matrix of Features

Feature Selection (FS) is an important pre-processing step in machine learning and data mining. All the traditional feature selection methods assume that the entire feature space is available from the beginning. However, online streaming features (OSF) are an integral part of many real-world applications. In OSF, the number of training examples is fixed while the number of features grows with t...

متن کامل

Prediction of the Type and Amount of Surface Water Pollutants using Time Series Models (ARIMA) and L-THIA Model (Case Study: Namrood Sub-Basin, Hablehrood Watershed)

     Due to the important role of non-point source pollution in water resources management, in this study time series modeling was applied to forecast water quality parameters and L-THIA model (one type of non-point source pollution models) was applied to estimate water pollutants. The purpose of this study was to compare results of L-THIA model and ARIMA models in Namrood sub-basin located in ...

متن کامل

Role of Internet Dependency on Online Social Capital among Graduate Students in University of Putra Malaysia

This study examined to study how respondents rely on the Internet to fulfill the various life goals dimensioned into understanding, orientation and playing goals, and how this dependency relates to the generation of social capital. Further, it examined the resources of social capital in terms of bonding social capita and bridging social capital. In this study quantitative research approach was ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013